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Experimental studies of the thermal denaturation of DNA yield a strong indication that the 
transition is first order. This transition has been theoretically studied since the early sixties, mostly 
within an approach in which the microscopic configurations of a DNA molecule are given by an 
alternating sequence of non-interacting bound segments and denaturated loops. Studies of these 
models neglect the repulsive, self-avoiding, interaction between different loops and segments and 
have invariably yielded continuous denaturation transitions. In this study we exploit recent results 
on scaling properties of polymer networks of arbitrary topology in order to take into account the 
excluded- volume interaction between denaturated loops and the rest of the chain. We thus obtain a 
first-order phase transition in d = 2 dimensions and above, in agreement with experiments. We also 
consider within our approach the unzipping transition, which takes place when the two DNA strands 
are pulled apart by an external force acting on one end. We find that the unzipping transition is 
also first order. Although the denaturation and unzipping transitions are thermodynamically first 
order, they do exhibit critical fluctuations in some of their properties. For instance, the loop size 
distribution decays algebraically at the transition and the length of the denaturated end segment 
diverges as the transition is approached. We evaluate these critical properties within our approach. 



I. INTRODUCTION 

The unbinding phase transition of the two complemen- 
tary strands of the DNA molecule has been a subject of 
continual interest for over four decades jl], |, |3[ ^, |[ ^, Q . 
In thermal denaturation this transition takes place when 
temperature is increased. In a typical experiment, a sam- 
ple containing DNA molecules of specific length and se- 
quence is prepared. The fraction of attached bound base 
pairs, 9, is measured through light absorption at a wave- 
length of 260 nm. At low temperatures all base pairs are 
attached to each other while at high temperature they 
are all unbound. Thus, 6 decreases from one to zero 
as the temperature is increased. For heterogeneous DNA 
molecules, containing both AT and GC base pairs, 9 does 
not decrease smoothly with temperature, but rather ex- 
hibits a multistep behavior. It consists of plateaus of 
various sizes separated by sharp jumps. This behavior is 
related to the fact that the GC bonds are stronger than 
AT ones. Thus, long domains with higher concentration 
of AT bonds will denaturate at lower temperatures. The 
resulting stepped structure of 9 is therefore characteristic 
of the particular DNA sequence. It thus yields statistical 
information on the sequence of the molecule under study. 
The denaturation process has been verified through elec- 
tron microscopy where denaturate loops and bound seg- 
ments have been observed directly The sharpness of 
the jumps indicates that the unbinding transition is first 
order. 

More recently, the introduction of new techniques 
such as optical tweezers and atomic force microscopy 
1^, [T^ has allowed the manipulation of single biological 
molecules. This made it possible to study a wider variety 



of physical properties of the DNA molecule. For exam- 
ple optical tweezers have been used to apply a force and 
pull apart the two strands at one end of the molecule. It 
is found that a phase transition takes place at a critical 
force where the molecule is unzipped and the two strands 
are separated |l^ . 

Thermal denaturation has been studied theoretically 
since the early sixties. The early models, which we re- 
fer to as Poland-Scheraga (PS) type models [|[ |^, con- 
sider the molecule as being composed of an alternating 
sequence of bound and denaturated segments. A bound 
segments is energetically favored over an unbound seg- 
ment, while a denaturated segment (loop) is entropically 
favored over a bound one. Within this approach the in- 
teraction between the different segments of the molecule 
has not been taken into account. This assumption simpli- 
fies the analysis considerably. The order of the transition 
is found to be determined by a parameter c which char- 
acterizes the statistical weight of a loop. The number 
of configurations of a loop of length £ behaves as s^/£'^ 
for large £. Here s is a non-universal constant. It has 
been shown ||l^ that the phase transition is first order if 
c > 2 and second order if 1 < c < 2 while for c < 1 no 
transition takes place and the strands are always bound. 
Using random-walk configurations to model a loop one 
finds that c = d/2 in d dimensions. The transition is thus 
predicted to be continuous in d = 2 and d = 3 dimensions 
| p2[ . This result is at variance with experimental obser- 
vations. The model was later generalized to take into 
account the self-avoiding interactions within each loop 
Ip^ , ^ . It is found that the loop entropy takes the same 
general form as before. However, the exponent c now 
takes the value dv, where ly the correlation length expo- 
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nent of a self-avoiding random walk. Inserting the known 
values for v one finds that although c is larger than that 
of a random- walk model, it is still smaller than two both 
in d = 2 and d = 3 dimensions, yielding a continuous 
transition. It was suggested that self-avoiding inter- 
actions between the various parts of the chain (and not 
just within loops) would further sharpen the transition 
possibly making it first order. However, theoretical tools 
for carrying out this analysis have not been available at 
the time. More recently, excluded volume interactions 
have been fully taken into account in a numerical study 
of finite chains These simulations strongly suggest 
that the transition is indeed first order. 

In a recent study we have extended the PS 

model to take into account self-avoiding interactions both 
within a loop and between a loop and the rest of the 
chain. To carry out the analysis of this model one has 
to enumerate the configurations of a loop embedded in 
a chain with self- avoiding interactions. This has been 
done by taking advantage of recent results obtained by 
Duplantier et al. |l^ for the number of configuration 
of a general polymer network. It is found that the statis- 
tical weight of a loop embedded in a chain has the same 
general form as before, namely j i'^ . However, the pa- 
rameter c is now modified and becomes larger than two 
in d > 2 dimensions. In particular one finds c ~ 2.115 in 
d = 3 and c = 2 -|- 13/32 in d = 2. Thus, self-avoiding 
interactions make the transition first order in two dimen- 
sions and above. Recently a different model in which 
excluded volume interactions were partially taken into 
account has been found to yield a first order transition 
. In this model excluded volume interactions between 
the two strands of the chain were explicitly considered, 
but those within each strand were neglected. 

In this work we present a detailed account of the re- 
sults obtained when the interactions between various seg- 
ments along the chain are taken into account using the 
scaling results of Duplantier. We show that the denatu- 
ration transition is first order. However, the transition is 
found to be accompanied by critical fluctuations in some 
the chain's properties. For example, the loop size distri- 
bution is found to decay algebraically at the transition. 
Indeed, the probability distribution for loops of length ^, 
P(£), behaves as P{i) ~ l/^'^ at the transition. This be- 
havior was recently confirmed in numerical simulations of 
the model where the excluded volume interactions have 
been taken into account fully. The value of the measured 
exponent c ~ 2.10 ±0.02 agrees well with our predictions 
pO| . We also find that when the boundary conditions 
are such that the chain is open at one end the length ^ of 
the end segment diverges as 1/|T — Tm| when the melting 
temperature Tm is approached. 

We have extended the model to consider the unzipping 
transition which takes place when a force of magnitude 
/ is applied to separate the two strands. We find that 
this transition is first order. We also find that the end 
segment length diverges as ^ '--^ 1/|/ — /u| where /u is 
the unzipping critical force. This behavior has been pre- 



viously found in models where self avoiding interactions 
were not taken into account |2^, ^ |2J, In cal- 
culating the critical force near the melting transition we 

find that /u ^ ir-rMT- 

The paper is organized as follows. In Section II we 
consider the thermal denaturation transition in detail. In 
Section IIA we review the analysis of Poland and Scher- 
aga and that of Fisher where self-avoiding interaction 
within a loop is taken into account. In Section IIB we 
analyze the model with full account of self-avoiding in- 
teractions. The length distribution of the end segment is 
considered in section IIC and a summary and overview 
of other approaches to thermal denaturation of DNA is 
given in Section IID. In Section HI the unzipping tran- 
sition is studied. A brief summary is given in section 
IV. 



II. THERMAL DENATURATION OF DNA 

A. The model and basic analysis 

The model considers two strands, each composed of 
monomers. Each monomer represents one persistence 
length of a single unbound strand. Typically this is about 
- 40 A l2l, or roughly 8 bases. We set the boundary 
conditions such that the monomers at one end of the 
molecule are bound. Such a boundary condition is nec- 
essary for a bound state between the two strands to exist. 
All other monomers on the chain can be either unbound 
or bound to a specific matching monomer on the second 
chain. The interactions between a monomer and other 
monomers on the second strand or on the same strand 
arc ignored. The binding energy iSg < between match- 
ing monomers is taken to be the same for all monomer 
pairs. 

A typical DNA configuration is shown in Fig. ^ It 
is made of an alternating sequence of bound segments 
and denaturated loops. The configuration ends with two 
denaturated strands. For simplicity, the configurational 
entropy of a bound segment associates with its embed- 
ding in ambient space is neglected. It is easy to verify 
that this assumption does not affect the nature of the 
denaturation transition obtained within this model. The 
statistical weight of a bound sequence of length i is then 
given by — Gx^{—iEQlT\ where T is the tempera- 
ture and the Boltzmann constant fee is set to 1. Thus, 
w is a decreasing function of the temperature. On the 
other hand, a denaturated loop does not carry an energy 
and its statistical weight is derived from its degeneracy. 
In this model it is assumed that the loop is fully flex- 
ible, and thus it is described by a random walk which 
returns to the origin after 21 steps. Considering all pos- 
sible such walks the statistical weight for large I has the 
form ri(2£) = As^ ji'^ ^ where s is a non-universal constant 
and the exponent c is determined by the properties of the 
loop configurations. For simplicity, we set A = \. Finally, 
the statistical weight of the end segment, which consists 
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FIG. 1: Schematic representation of a microscopic configura- 
tion of the DNA molecule. 




FIG. 2: Graphic illustration of the expansion of the partition 
function (1) in U{z)V[z). Each segment of type Uo,U,V or Q 
represents a sum over all possible lengths of its type weighted 
properly with a fugacity. 



of two denaturated strands each of length £, takes the 
form A{2i) = Bs^ jl"^ for large where c is in general 
not equal to c. Again, for simplicity we set _B = 1. The 
values of the exponents c and c will be held arbitrary for 
the moment. We shall later discuss them in detail. 

Using the weights assigned to each segment of the chain 
the total weight of any given configuration may be calcu- 
lated. For example the weight of a chain which consists 
of a bound segment of length £i, a denaturated loop of 
length £2, a bound segment of length ^3, and a pair of 
denaturated strands of length £4, is given by 



w 



A(24). 



(1) 



The statistical weight of a more general chain configu- 
ration made of p alternating bound segments and de- 
naturated loops will have the same form with a suitable 
number of factors of the form ■ u'^'+^ before the 

end-segment weight A(2^p). 

The model is most easily studied within the grand 
canonical ensemble where the total chain length L is al- 
lowed to fluctuate. The grand canonical partition func- 
tion, is given by 



L=0 



Vo{z)Qiz) 
-U{z)V{z)' 



(2) 



where Z(L) is the canonical partition function of a chain 
of length L, z is the fugacity, the functions U{z), V{z) 
and Q{z) are defined by 



U{z) 
Viz) 



00 p 

— s 



E 



w z 



Q{z) = l + £A(2V-l + Ei' 



1=1 



= l + $e(zs), 



(3) 
(4) 

(5) 



and V{){z) = 1 + V{z). In this equation, $c(-z) is the 
polylog function whose basic properties are summarized 
in Appendix Equation (|^) can be verified by expand- 
ing the partition function as a series in U{z)V{z). The 
factors V{){z) and Q{z) properly account for the bound- 
aries. A graphical illustration of the series expansion in 
U{z)V{z) is given in Fig. |[ To set the average chain 
length, (L), one has to choose a fugacity such that 



{L) = dXnZ/dXnz. 



(6) 



This implies that the thermodynamic limit {L) 00 in 
obtained by letting z approach the lowest fugacity z* for 
which the partition function (^) diverges. This can arise 
either from the divergence of the numerator or from the 
vanishing of the denominator. The relevant situation, at 
low temperature, is the second one, which corresponds to 
z* satisfying 



U{z*)V{z*) = 1. 
Since V{z) ~ wz/{l — wz) this reduces to 
U(z*) = l/(wz*) - 1. 



(7) 



(8) 



We shall see that above the transition, namely in the 
denaturated phase, the numerator diverges. Moreover, 
when one considers the problem of DNA unzipping by 
applying an external force on the strands, a divergence 
arising from a boundary factor will play an important 
role. 

The fraction of bound monomer pairs 9 is the experi- 
mentally measured quantity and the order parameter of 
the transition. Its temperature dependence in the ther- 
modynamic limit (L) — > 00 can be calculated from the 
behavior of z*{w). The average number of bound pairs 
in a chain is given by (to) = 9 In 2/9 In w, so that 



9 = lim 



(to) 



d\nz* 
dhiw 



(9) 



Thus the nature of the denaturation transition is deter- 
mined by the temperature dependence of the fugacity 
z*(w). This behavior can be classified into three distinct 
regimes depending on the value of the exponent c. These 
regimes are most easily understood through a graphical 
solution of (^. 



Case (i): c < 1. No phase transition 

A schematic representation of the graphical solution of 
(^) in this case is given in Fig. ^. The function U{z) is 
finite for any z < zm — 1/s. Since the sum 



00 

f^(v^) = E7^' 



(10) 



1=1 



diverges for c < 1, the function U{z) increases smoothly 
to infinity as z approaches 1/s. For a given value of z the 
function 1/V{z) increases as the temperature increases. 



4 




In Fig. 1^ 1/V{z) is plotted vs. z for three values of w. 
One can see that as the temperature is increased the 
crossing point z* of the two graphs increases smoothly 
until it saturates a,t w = Woc (T = oo). Therefore 6 
decreases smoothly as temperature is increased and no 
phase transition takes place. In this case the strands are 
always bound at all temperatures. 



Case (a): 1 < c < 2. Continuous phase transition 

A schematic representation of the graphical solution 
of Eq^^ in this case is given in Fig. ^. Here the 
sum (IIO) is finite at z = zm = 1/s since c > 1. 
The function U{z) = <I>c(zs) increases smoothly to a fi- 
nite value as z approaches zm and becomes infinite for 
z > Zm. In Fig. ^ the function 1/V{z) is plotted for 
three values of w. One can see that as temperature is 
increased z* increases until it reaches 1/s at w = wm- 
Above the transition, for w < wm, z* remains equal to 
1/s in the thermodynamic limit. Here, the (L) oo 
limit is obtained through the divergence of the factor 
Q{z = 1/s) in the numerator. A more careful analysis 
of the denaturated regime is presented in Appendix 
Note that for a transition to take place one must have 
l/V{z = l/s,w = 1) > [/(1/s). Otherwise there is no 
phase transition and the two strands are bound at all 
temperatures. Since c < 2 the derivative of U{z) diverges 
at the transition. This implies that 9 — d\n z* /dlnw ap- 
proaches zero continuously, yielding a continuous transi- 
tion. Since the derivative din z* /dlnw decreases with 
increasing c, the closer c to two, the sharper the transi- 
tion. 



Case (Hi): c > 2. First order phase transition 

A schematic representation of the graphical solution of 
Eq. (|^) in this case is given in Fig. ^. Here both U{z) and 
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FIG. 5: A typical behavior of the functions U and 1/V for 
c > 2 (here c = 2.5). See text for explanation. 



its derivative are finite at z = zm = 1/s. As in the previ- 
ous case there is a transition for w — wm- However, since 
the derivative of U{z) is finite, 9 approaches a finite value 
as the transition is approached from below. Above the 
transition 9 vanishes in the thermodynamic limit. The 
transition is therefore first-order. The discussion of the 
high-temperature phase is again deferred to Appendix 

To summarize, three different scenarios exist depend- 
ing only on the value of c. These are 



c < 1 
1 < c < 2 
c> 2 



no phase transition; 
continuous phase transition; 
first-order phase transition. 



The nature of the phase transition is thus directly related 
to the number of configurations of long denaturated loops 
within the chain. In the early studies of this problem the 
exponent c was evaluated by considering the number of 



random walks which return to the origin as modehng the 
loop configurations It is easy to show that in d 

dimensions the model yields c d/2. This implies that 
there is no transition for d < 2, a continuous transition 
for 2 < d < 4 and a first-order phase transition for d > 
4. The model was subsequently extended to include the 
repulsive short range interaction which exists between the 
strands constituting a loop. In this approach the loop is 
modeled as a self-avoiding walk Q. This yields c = dv, 
where v is the exponent associated with the radius of 
gyration Rq . For a self avoiding random walk of length 
L one has Rq ^ , with = 3/4 in d = 2 and w 0.588 
in d = 3. This yields c = 3/2 in d = 2 and c — 1.766 in 
d = 3. Thus the transition is continuous in both cases, 
although it is sharper than when the repulsive interaction 
is neglected altogether. 

The two estimates of the exponent c described above 
treat the loop as an isolated object and thus neglect its 
interaction with the rest of the chain. This simplification 
seems essential, since the formalism of the model relies 
on the segments composing the chain as being indepen- 
dent. In the next section we show that the repulsive 
interaction between a loop and the rest of the chain may 
be accounted for. Although we treat these interactions 
only in an approximate way, we are able to give insight 
into the unbinding mechanism and on the nature of the 
transition. 



B. Excluded- volume effects 

To account for the excluded volume interactions be- 
tween a loop and the rest of the chain we note that a mi- 
croscopic configuration of the DNA molecule is composed 
of many bound and unbound segments of various length. 
When evaluating the number of available configurations 
of a loop, one has to take into account the interactions 
with all these bound and unbound segments. Here we 
simplify the problem and neglect the internal structure 
of the rest of the chain. We thus consider a loop embed- 
ded in a flexible chain (see Fig. ^) and study the number 
of configurations of a chain endowed with this topology, 
assuming that it is self avoiding. We will show that in the 
limit where the loop length, 2£, is much smaller than the 
length of the rest of the chain, 2L, the statistical weight 
of this topology can be written as a product of the sta- 
tistical weights of the loop and of the chain. The weight 
of the loop is found to be of the same form as that of a 
free loop but with a different exponent c. This exponent 
is found to be larger than 2 in dimensions 2 and above, 
yielding a first order denaturation transition. 

To carry out this analysis we use results obtained by 
Duplantier et al jT^, |l^ for the number of configurations 
of polymer networks of arbitrary topology. In order to 
make the paper self contained we first review in some 
detail these results. They represent an extension of the 
well known results for the number of configurations of a 
simple self-avoiding random walk p^. In this case it is 
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FIG. 6: The topology of the loop embedded in a chain. The 
length of the chain from a vertex of type VI to the nearest 
vertex of type V3 is L. The length of each of the two strands 
connected to the V3 vertices is £. 




FIG. 7: An example of a polymer network. The network has 
four vertices Vi of order 1, two vertices V3 of order 3 and one 
vertex Vi of order 4. It also has one loop. Thus in Eq. ^ 
ni = 4, ns = 2, n4 = 1 and C = 1. 



known that the number of configurations scales as 

Flincar L^-\ (11) 

where L is the length of the polymer, s is a non-universal 
constant and 7 is a universal exponent. The exponent 7 
is known exactly in d = 2, numerically in d = 3 and via an 
e expansion in d = 4 — e. Above d = 4 self-avoiding inter- 
actions becomes irrelevant and thus the number of con- 
figurations of self-avoiding random walks scales as that 
of ordinary random walks, yielding 7=1. 

The generalization of this result to an arbitrary poly- 
mer network goes as follows: Consider a branched self- 
avoiding polymer Q of arbitrary topology (see for exam- 
ple Fig. ^ . The polymer is made of N chains of lengths 
£i,£2, ■ ■ ■ ^N- These are tied together at vertices with dif- 
ferent number of legs. A vertex with k legs is said to be 
of order k {k > 1). The number of vertices of order k 
is denoted by Uk- The number of configurations of the 
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network, Fg, is then given by 

r,.. (|,|,..., I). a.) 

where L = it is the total length of the network and 
(7 is a scaling function. This expression is valid asymp- 
totically as L ^ cxD, as long as all £i's diverge with the 
same rate. We note that the relation is valid also when 
the persistence length of each of the chains composing 
the graph is different. This variance can be absorbed 
through a rescaling of the chains lengths li. The expo- 
nent 7g depends only on the topology of the network and 
is given by 

7g = 1 - i^d/: + ^ nfeCTfc. (13) 
fc>i 

Here C is the number of independent loops in the net- 
work, d the spatial dimension and v is the exponent re- 
lated to the radius of gyration of a self-avoiding random 
walk. Each fc-vertex contributes its scaling dimension 
CTfc. Since vertices of order k appear times we ob- 
tain the term UkCfk- Specifically, if all lengths £i scale 
in the same way, the number of configurations scales as 
Tg - s^LTs-i. The scaling dimensions Cfe, defined for 
k > 1, are known exactly in d — 2 from conformal invari- 
ance: 

cTfe = (2- A:)(9A: + 2)/64, (14) 

and to order in d = 4 — e: 

(Tfc = ie/8){2-k)k/2 

+ (e/8)2fc(fc - 2)(8fc - 21)/8 + 0{e^). (15) 

Also, good estimates for the values of the exponents in 
d = 3 are available through Fade and Fade-Borel approx- 
imants. Clearly (72 = as one would expect and above 
d — A, where the self-avoiding interaction is irrelevant, 
all the exponents ak are zero. 

Consider now the topology depicted in Fig. ||. We are 
interested in finding the number of configurations of the 
network in the limit £ <^ L, when the loop size is much 
smaller than the length of the rest of the chain. Using 
the results by Duplantier (see Eq. (p^)), the number of 
configurations can be written as 

T ^ s^+^iL + ey^-^^-^gii/L), (16) 

for large L and £. Here g{x) is a scaling function and 
7ioop can be evaluated using Eq. (p^. For the topology 
considered above of a loop embedded in two segments 
(Fig. |6|) we have: one loop, C = 1; two vertices of order 
one, ni — 2, corresponding to the two free ends of the 
chain (denoted by VI in the figure); and two vertices of 
order three, = 2 (denoted in the figure by ^3). Using 
Eq. (|l|) we obtain 

7ioop = 1 - di^ + 2(71 + 2(73. (17) 



The limit of interest is that of a loop size much smaller 
than the length of the chain, l/L <^ 1. Clearly, in the 
limit l/L 0, the number of configurations should re- 
duce to that of a single self-avoiding open chain, which, to 
leading order in L, is given by s^L''^^ , where 7 — l-\-2ai. 
This implies that in the limit x 

g{x) - x^i-p-T. (18) 

Thus the ^-dependence of F, which gives the change in the 
number of configurations available to the loop, is given 
by 

r _ S^^7loop-7 . gi^.'^-l, (-19) 

It is therefore evident that, for large i and L and in the 
limit t/L <^ 1, the partition sum is decomposed into a 
product of the partition sums of the loop and that of 
the rest of the chain. The excluded volume interaction 
between the loop and the rest of the chain is reflected 
in the value of the effective exponent c. This result is 
very helpful since it enables one to extend the Poland- 
Scheraga approach described in the previous section to 
the case of interacting loops. From Eq. (|l^) one sees that 
the appropriate effective exponent c is given by 

c = 7 - 7100P cfi^ - 2(73. (20) 

In d = 2, (73 = -29/64 Q and = 3/4, yielding 

c = 2 + 13/32. (21) 

In (f = 4 - e to 0(e2), one has (73 = -3e/16 + 9eV512 
and V = 1/2(1 + e/8 + 15/4(e/8)2), yielding 

c = 2 + £/8 + 5£V256 . (22) 

In d = 3, one may use Fade and Fade-Borel approxima- 
tions to obtain (73 « —0.175 |l8| which with the value 
V K. 0.588 yields 

c« 2.115. (23) 

The value of the exponent c is unaffected by the different 
persistence length of a bound and unbound DNA seg- 
ment. This is since as stated above Eq. (^2|) is valid also 
when the persistence length of different polymers com- 
posing the network are different. Equation ( |20| ) can be 
understood intuitively by remarking that taking the limit 
i/L corresponds to shrinking the loop. By doing so 
one loop and two vertices of order 3 are eliminated. The 
exponent c is the difference between the exponent 75 of 
the network after shrinking the loop and the same ex- 
ponent before the loop has shrunk. Therefore c gets a 
contribution of dv from the eliminated loop and —2(73 
from the two vertices of order 3. 

As stated above the rest of the chain is in fact com- 
posed of both bound and unbound segments. This struc- 
ture has been neglected in the above analysis. To esti- 
mate the effect of the denaturated segments we consider 
the extreme case in which the rest of the chain is fully 
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FIG. 8: An extreme topology where the loop of length 21 
is embedded in two denaturated loops of size 2L each. The 
vertices of order 4 are denoted by Vi. 



of the exponent c is altered to ( p^ ) (see Fig. ||) when one 
of the segments connected to the loop is short. This may 
be verified by noticing that as the loop is shrunk a vertex 
of order 4 and one loop are lost. Note that this does not 
imply that these values of c can be used to give bounds 
on the true partition function of the model. 



C. The end segment distribution 



denaturated. That is, a loop embedded within two large 
loops each of size 2L (see Fig. ||). An analysis similar to 
the one presented above yields for the value of c, 



= 2 + 11/16, 

= 2 + £/4- 15eVl28 



in d = 2 
in d = A 



(24) 
(25) 
e, (26) 



where the values 0-4 = —19/16 in d = 2 and (T4 = — e/2 + 
ll(£^/8)^ in d = 4 — e dimensions are used along 
with those of v. Using (T4 w —0.46 obtained by Fade 
and Fade-Borel approximations gives in d = 3 the value 
c w 2.22. Therefore, the effect of the extra excluded 
volume interaction is to increase the value of c. It is easy 
to check that the exponent governing the ^-dependence 
remains unchanged even if the end points of one or both 
the outer loops are set to be unbound. 

In both topologies considered above the value of the 
exponent c is larger than 2ind = 2,o? = 4 — e and d — 3 
and strongly suggests that the transition is first order for 
any d > 2. Our analysis assumes that the size of the 
loops in the system is much smaller than the total chain 
length. The fact that the transition is first order implies 
that the loop size remains finite. This makes the analysis 
self-consistent. Note that the loop size distribution, P{i), 
is rather broad at the transition and behaves, for large i, 
as 



Pi£) 



1 



(27) 



Thus high enough moments of the loop size distribution 
always diverge. Although the transition is first order for 
c > 2 it exhibits some critical properties. Our analysis 
suggests that 2 < c < 3, so that already the variance of 
the loop size is predicted to diverge. A recent numerical 
study of the loop size distribution, where the excluded 
volume interactions have been fully taken into account, 
has verified this prediction with a measured value of c = 
2.10 ±0.02 lo). 

The exponent c is an effective exponent calculated by 
assuming a specific structure of the whole chain. In fact 
for any configuration, as long as the bound segments con- 
nected to the denaturated loop under consideration are 
long, the value of the exponent c remains the same as that 
given by (pO| ) (see Fig. |^). This is due to the fact that 
shrinking the loop eliminates one loop and two vertices of 
order 3. This result is also easy to verify using a scaling 
argument similar to the one presented above. The value 



In the previous section it was argued that the excluded 
volume interactions between a denaturated loop and the 
rest of the chain cause the transition to be first order. As 
was noted this result is unaffected by the boundary terms 
Vo{z) and Q{z) (see Eq. (||)). However, when the DNA 
molecule is fully denaturated, its statistical properties are 
determined by the boundary term Q{z). This is evident 
as the entropy of a denaturated molecule with free ends is 
much higher than that of one with the ends constraint to 
meet each other. Namely, the number of configurations of 
a self-avoiding random walk is much higher than that of 
a self-avoiding random walk which returns to the origin. 
Thus, one expects the average length of the end segment 
to diverge at the transition. This does not affect our 
previous results as long as the thermodynamic limit, L — *■ 
cxD, is taken before the temperature T approaches the 
melting temperature Tm- In this case the size of the end 
segment is always much smaller than the rest of the chain. 

The average length of the end segment is given by 



dlnQ 



dz 



(28) 



where z* is evaluated using Eq. (0). The behavior of ^ 
near the transition takes one of three forms depending 
on the numerical value of c: 

(i) c < 1. In this case Q{sz)_ = 1 + <f>c(sz) diverges like 

zm = 1/s. Clearly 



zm\ 



(see Appendix lA[) as z 



its derivative with respect to z diverges like |z — zmI*^ 
and hence the average length of the end segment diverges 
like f ^ |z* - zmI""^. 

(ii) 1 < c < 2. Here Q{sz) is finite but its derivative 
diverges like |z — zmI*^"^ as z ^ zm — 1/s. Thus, the 
average length of the end segment diverges like |z* — 

^ |5-2 

zm\ 

(Hi) c > 2. In this case both Q{sz) and its derivative are 
finite at the transition. Hence the end segment length is 
finite at the transition. 

We next use the temperature dependence of z*(w) in 
order to evaluate the behavior of the end segment length 
^ as a function of the temperature near the transition. 
Here one finds two regimes depending on the value of the 
exponent c related to the entropy of the bulk loops . 
To derive these relations we note that for z* close to zm 
Eq.([A^) yields for c > 1, 



U(zm)-U{z*)^\zm-z*\<, 



(29) 
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where C = min(l, c — 1). Using u>(z) = + /7(z))] ^ 
from Eq. (Kl) we obtain 



U{zm) - U{z*) 



1 



1 



(30) 



wmZm w{z*)z*' 
where wm = w{zm)- Rewriting this expression yields 

- z*\'' awuiz* — zm) + a{w{z*) - wm)z* , (31) 

where a = 1/{wmzmw{z*)z*) is finite at the transition. 
Clearly w{z*) — wm Tm — T near the transition, where 
Tm is the melting temperature and T is the temper- 
ature corresponding to w{z*). Thus, Eq. ( |3l| ) yields 
two regimes for the temperature dependence of z* . For 
1 < c < 2 the linear term in (z* — zm) in Eq.(pTl) is neg- 
ligibly small compared to the left hand side and hence 

ZM-z* ^\T-Tm\"^^-^\ (32) 
On the other hand for c > 2 C = 1 and therefore 



\T-T, 



Ml 



(33) 



Thus, the behavior of the end segment length ^ de- 
pends on both exponents c and c. We have already shown 
that c > 2. We now turn to estimate the exponent c. This 
exponent is associated with the degeneracy , s^/^'^, of an 
end segment of length and its value may be deduced us- 
ing the scaling argument presented above. One considers 
a Y-fork topology where the two denaturated strands are 
much smaller than the bound segment representing the 
rest of the chain. Using the terminology introduced in the 
preceding subsection one has to consider the difference of 
the exponent corresponding to the Y-fork topology and 
the one corresponding to linear chain. This yields 



-(O-I + (Ts)- 



(34) 



In d = 2, (Ti = 11/64 and cts = -29/64, so that c = 
9/32. In d = 3, one may use the Pade and Pade-Borel 
approximation results ci ~ 0.083 and (T3 ~ —0.175 to 
obtain c ~ 0.258. Finally in d = 4 - e, c = e/8 -I- ©(e^) 
while above four dimensions clearly c ~ Q. This suggests 
that the exponent c is smaller than one for any d > 2. 

Since the degeneracy exponents satisfy c > 2 and c < 
1 the analysis presented above suggests that near the 
melting transition the end segment length diverges like 



1 



|t-Tm| 



(35) 



D. Summary and overview of thermal denaturation 

In the preceding subsections we reviewed the analy- 
sis of Poland and Scheraga, showing that the nature of 
the denaturation transition is governed by the exponent 
c associated with the entropy of a denaturated loop em- 
bedded in a DNA molecule. It is found that for c < 1 



there is no transition, for 1 < c < 2 there is a continuous 
transition, while for c > 2 the transition is first order. 
Using recent results for the number of configurations of 
a self avoiding polymer network of general topology we 
have demonstrated that c > 2 in d = 2 dimensions and 
above. While the treatment is approximate the results 
strongly suggest that the transition is indeed first order 
in these dimensions. The results are unchanged even if 
the different persistence length of the bound and denat- 
urated configurations is taken into account. 

The analysis of the length distribution of the end seg- 
ment shows that the molecule melts from the unbound 
end. The boundary condition used in this analysis is such 
that the two strands are bound at one end. It is easy to 
show that when the boundary condition is modified such 
that the two strands are bound at the center rather than 
at one end, melting takes place from both ends. 

It is interesting to consider the case of a homopolymer 
where all the bases on one strand are the same. For 
example a molecule in which one strand is made only of G 
bases and the other strand is made only of C bases. Here 
each base, and therefore a monomer in the model, can 
bind to any other base on the other strands and not just 
to a specific monomer, as is the case for a heteropolymer. 
The two arms of a loop need not be of the same length. 
Thus the number of configurations of a loop of length I 
changes by a factor of This amounts to using ^ ■ ji'^ 
for the weight of a loop, effectively reducing the exponent 
c to c — 1. Since we have shown that 2 < c < 3 for d > 2, 
this implies for a homopolymer the effective c is smaller 
than 2 but greater than 1. The transition in this case is 
thus expected to be continuous. 

Finally, we comment on recent attempts to account 
for the first-order nature of the denaturation transition 
without having to resort to excluded volume interactions. 
Within this approach the two strands are considered as 
directed polymers and thus they do not self intersect. Let 
V{r) the interaction potential between the corresponding 
monomers on the two strands. It is repulsive at short 
distances, has an attractive well at the characteristic pair 
bond distance and it tends to zero at large distances. 
Using a transfer matrix approach, the thermodynamic 
properties of the chain are obtained by the ground state 
ipo (^) wavefunction of a Schrodinger equation which takes 



the form |27 



2m (7^ 



(36) 



Here eg is the ground state eigenvalue associated with 
the wave function ipQ (r) . The mass m represents the stiff- 
ness of the chain. The r-dependence of the mass is taken 
to account for the change of stiffness between bound and 
unbound strands 0. One usually assumes that the po- 
tential U(r) is short ranged (exponentially decaying with 
r at large distances) and that m{'r) varies with r over the 
same distance. The probability density of finding the 
strands a distance r from each other is given by \^po{r)\'^. 



9 



In this language the order parameter 9 is given by 



(37) 



where a is the range of the binding potential V{r). Here, 
for simplicity, we assume that neither m(r) nor V{r) de- 
pend on the orientation. 

The denaturation transition occurs when eo reaches 
zero so that there is no longer a bound state in the sys- 
tem. A first order transition occurs if has a non-zero 
value at that point. In this case one should have a bound 
state with energy eq = 0. Note that although 9 might 
jump at the transition, the average distance between the 
strands, which is not the order parameter of the system, 
might diverge continuously. Thus, we are interested in 
the solution of the equation 



VVo(?^) + 2m{r)V(r)^o{f) = 0. 



(38) 



Since V{r) decays exponentially with r, the asymptotic 
behavior of V'o('^) large distances is obtained by the 
equation 



V^V'olr) -0. 



(39) 



It is easy to show that this implies '0o(^) ^ for 
large r. That is, the wave function for large r is normal- 
izable only for d > 4 so one has a localized zero energy 
ground state. Therefore using Eq. ( |37| ) one can see that 
for any dimension d < 4 the order parameter 9 vanishes 
at the transition as is expected for a continuous transi- 
tion. A first-order phase transition occurs only for d > 4. 
Note that this result is equivalent to using the Poland- 
Scheraga model with random walks modeling the loop 
entropy. Within this approach the transition may thus 
be altered to first order in lower dimensions o«Zw when 
V{t) are long-range, in contrast to recent claims 

Recently, it has been argued that taking into account 
excluded-volume interactions between the two strands 
while neglecting these interactions within each strand ef- 
fectively leads to a long range potential between the two 
strands This model, which takes into account the 

excluded volume interactions only partially also leads to 
a first order transition. 



III. THE UNZIPPING TRANSITION 

The introduction of new and powerful techniques, such 
as optical tweezers Q and atomic force microscopes , 
has made possible the manipulation of single biological 
macromolecules. A number of experiments have investi- 
gated the response of double-stranded DNA to external 
forces and torques ||2^. Recently, it has become possible 
to apply and measure a force pulling apart two strands 
of a DNA double helix Previous theoretical studies 
of the unzipping transition have been carried out using 
the directed polymer approach where self-avoiding inter- 
actions are not accounted for lEll, ^ |23, |25tl . In most 



of these studies the unzipping of homopolymers has been 
analyzed. Heterogeneous chains have also been consid- 
ered in some of these studies In this section we 
extend the analysis of the PS model to consider the un- 
zipping of homopolymers with self-avoiding interactions. 
We show that the unzipping transition is first order. We 
also calculate the dependence of the critical unzipping 
force on the temperature at low forces, namely near the 
melting temperature. 

We consider a configuration where the corresponding 
monomers at one end of the chain are bound together, 
while a force / is applied on the two monomers at the 
other end of the chain, pulling the two strands apart. In 
this setup, the grand canonical partition function takes 
the form 



Z 



Vo{z)0{z) 

1 - u{z)v(zy 



(40) 



where the factor 0{z) is the grand partition function of 
the open tail under force. We have 



0{z) = l+^Ze„d(^)^ 



(41) 



where 2^cnd(^) is the canonical partition function of an 
open end composed of two strands, each of length I. 

To evaluate Zcnd(^) we note that when no force is ap- 
plied the partition sum takes the form 



Ze„d(^)=A(2^)~-, 



(42) 



where, as discussed in sec. |IC|, c is given by Eq. (|3j). 
When a force / is applied, we have 



^cnd(^) = A(2£) / drpfXr) exp(/ • f/T), 



(43) 



where (r) is the probability distribution of the end-to- 
end distance in the absence of a force. Turning to angular 
coordinates we obtain 



2e„d(^)=A(2€)X,(//r), 



(44) 



where 



Xg{f/T) ^ S d0sin'^"2 / 

X pf(r) exp(/rcos(/)/T), (45) 

in which is a constant which depends on dimensionality. 
We assume that Pi{r) has the same scaling form as that 
of linear polymers 

peir) = R-^pir/R). (46) 
Here i? is a scaling length related to £ by 

R-^Rot, (47) 
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where v is the correlation length exponent of a linear 
polymer. 

We are interested in the behavior of p(a;) at x 3> 1 (see 
below). We assume that in this limit vix) takes a form 
similar to that corresponding to a linear polymer |29] 



f>{x) = Px^ exp{-Dx^). 



(48) 



Here P and D are constants, A = 1/(1 — ly) and the 
exponent /i is given by 



{d/2 + vd~c)/{l-v). 



(49) 



This result can be obtained by applying the same rea- 
soning used to derive the corresponding expression for 
linear polymers to the Y-fork configurations which are of 
interest here |29, |3^. Substituting into expression ( ^ ) 
we obtain 



liUlT) oc / d^sih 
/o 



X exp 



"-^(p / dxx''-^+>^ 
(^-Dx^/^ + UX COS cj)^ (50) 



where u = fR/T. This expression is valid provided the 
integral is dominated by large values of x, which is the 
case for u ^ 1. This appears to be the relevant regime for 
piconewton forces. In this situation we can evaluate the 
integral by steepest descent and obtain the saddle-point 
equations 



= 0; 



Xu 

U 



1/(A-1) 



(51) 
(52) 



They correspond to the non-Hookean elongation regime, 
described by de Gennes p. 47ff]. Note, that x* scales 
as u^^^-^~^'> and that A > 1. Since we are interested in 
the limit oi u ^ 1 this justifies our choice of using the 
tail of the distribution p{x). Therefore 

Mf/T) oc ^M(i--)+|(i-2.) (^AifRo/ry/^'e) , (53) 

where A is a constant. Using this expression we have 



OM.l.f [-'■'W''")!' . ,54, 

£=1 



Here z*{w) is the solution of Eq. (^ (corresponding to 
an infinitely long polymer). At this point the average 
length of a loop in the bulk is finite. Hence the unzipping 
transition is first order. 

Near the transition, the length ^ of the end segment di- 
verges like \z* — zu\~^, where z\j — e''^'^^"/'^^^'" / s. This 
is a result of the fact that the exponent /2 is smaller than 
1 (in fact it vanishes, as we have seen). Since z* is regular 
in/, wehave^^ IZ-Zul-i ov ^ ^ \T -T^{f)\-\ Thus 
the two strands separate gradually from the end as the 
critical force is approached. Nonetheless the unzipping 
transition is first order. The reason is that the transi- 
tion takes place at a temperature below the denaturation 
melting temperature Tm where the loop size distribution 
in the interior of the chain decays exponentially with the 
loop size. Thus at this point the average loop size in 
the interior of the chain is finite. On the other hand the 
length of the end segment is finite as long as / < /u and 
its contribution to the order parameter 6 and to the en- 
tropy is negligible. Therefore both the order parameter 
and the entropy exhibit a discontinuity to their values in 
the unzipped state at the transition. Let us remark that 
Eq. (Bq) implies that the critical force /u behaves like 



\T-T, 



Ml 



(57) 



where the exponent /i is given by 



as r — > Tm, at least as long as the forces are not too 
small (so that the it 3> 1 limit in ( ^o|) is valid). 

IV. SUMMARY 

In this paper we have extended the Poland-Scheraga 
type models introduced in the late fifties to fully take 
into account the effect of self-avoiding interactions on the 
DNA denaturation transition. We have shown that the 
model yields a first-order transition in agreement with 
experiments and with numerical simulations. Although 
the transition is thermodynamically first order it exhibits 
critical behavior in some of its properties, such as its loop 
size distribution and the length of the end segment. It 
would be of interest to study these properties experimen- 
tally to test these predictions. 

We have also studied the unzipping transition of DNA 
and found it to be first order. We have evaluated the 
behavior of the unzipping force near the melting point. 
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M = -c- ^(1 - I/) - -(1 - 2v). 
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APPENDIX A: PROPERTIES OF THE POLYLOG 
FUNCTION 

We summarize here a few elementary properties of the 
polylog function that are used in the text Sec. 1.11, 
p. 27ff.]. The polylog function <&c(-z) is defined by the 
series 



Since the second term in Eq. (Bl) is equal to the number 
of units in the end segment, and since this is the dominant 
term we conclude that almost all units belong to it in the 
thermodynamic limit. 

If both ends of the DNA chain are constrained to be 
bound, the partition function assumes the form 



e=i 



(Al) 



which converges for \z\ < 1. For |z| < 1 and 5Re c > it 
has the integral representation 



1 



r(c) 



dtt" 



1 



(A2) 



where r(c) is Euler's gamma function. From Eq. ( |A2| ) it 

is easy to see that ^c{z) diverges like |z — 1|^~^ for z ^ 1, 
if c < 1, and that, if c > 1 and 1 — 2; = e <C 1, one has 



$c(l) - $c(l - e) e'^ 



(A3) 



where the expon ent C is equal to min(l, c — 1). From the 
series definition (Al) it is also evident that 



(A4) 



APPENDIX B: HIGH-TEMPERATURE PHASE 

Above the transition, i.e., for w < wm, the fugacity z 
becomes zm — ^/s in the thermodynamic limit. In this 
Appendix we discuss in some detail how the thermody- 
namic limit is taken. 

The value of the fugacity for a chain of length L is 
obtained by solving Eq. (0) for the fugacity z. It reads 



Vo{z) " Q{z) 



y{z) U'{z)V{z) + U{z)V'{z) 



1 - U{z)V{z) 



(Bl) 

For w < wm, as z approaches zm, the first term is regular, 
while the second diverges as |z — zm|~^. In the third term 
the denominator does not vanish. The term may or may 
not diverge depending on whether U'{z) diverges at zm, 
namely, according to whether c is smaller or larger than 
2. However, in any case this term is much smaller than 
the second. We therefore have 



\z - zm\ 



(B2) 



VoHz) 



1 - U{z)Viz) 



(B3) 



Therefore the most singular term in Eq. (Bl) is absent. If 
c < 2, U'{z) diverges as \z — zmI'^"^, so that the equation 
corresponding to (Bl) can still be solved for any finite L. 
This means that the typical size of the denaturated loops 
increases with increasing L. 

In the case c > 2, on the other hand, the derivative of 
InZ with respect to Inz remains finite even for z = zm- 
The chain thus seems to have a maximum length, Lq (w) , 
corresponding to 2 = zm- Therefore, for longer chains 
one has to consider the total length constraint more ex- 
plicitly. To do that we introduce a maximal length cutoff, 
L, in the sums (||) and (^) defining U{z) and V{z), re- 
spectively. With this cutoff the number of terms in each 
of these series is finite, and thus they may be evaluated 
even for zs > 1. In this case the last term in Eq. (Bl) 
diverges when U{z)V{z) — 1 vanishes. Therefore, for any 
(L), and particularly for (i) > Lo{w), one can find a 
fugacity which satisfies 1 — U{z)V{z) 1/{L). Since for 
large L, U{z) grows exponentially as (sz)^ this imphes, 
to leading order in L, 



{sz)- ^ 1 - _ - 



a 

(l) 



1 



(B4) 



where a is a constant. For large L the right hand side of 
this equation approaches a constant, independent of L. 
This is due to the fact that for z larger but sufficiently 
close to Zm and at temperatures above the melting tem- 
perature V{z) is finite even for L going to infinity. In the 
limit L — !■ cx) the fugacity behaves as 



O 



(B5) 



Thus, in this case, z approaches its limiting value from 
above. 
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